AITopics | black and white photo

Collaborating Authors

black and white photo

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Measuring similarity between embedding spaces using induced neighborhood graphs

Tavares, Tiago F., Ayres, Fabio, Smaragdis, Paris

arXiv.org Artificial IntelligenceNov-13-2024

Deep Learning techniques have excelled at generating embedding spaces that capture semantic similarities between items. Often these representations are paired, enabling experiments with analogies (pairs within the same domain) and cross-modality (pairs across domains). These experiments are based on specific assumptions about the geometry of embedding spaces, which allow finding paired items by extrapolating the positional relationships between embedding pairs in the training dataset, allowing for tasks such as finding new analogies, and multimodal zero-shot classification. In this work, we propose a metric to evaluate the similarity between paired item representations. Our proposal is built from the structural similarity between the nearest-neighbors induced graphs of each representation, and can be configured to compare spaces based on different distance metrics and on different neighborhood sizes. We demonstrate that our proposal can be used to identify similar structures at different scales, which is hard to achieve with kernel methods such as Centered Kernel Alignment (CKA). We further illustrate our method with two case studies: an analogy task using GloVe embeddings, and zero-shot classification in the CIFAR-100 dataset using CLIP embeddings. Our results show that accuracy in both analogy and zero-shot classification tasks correlates with the embedding similarity. These findings can help explain performance differences in these tasks, and may lead to improved design of paired-embedding models in the future.

nng, representation, similarity, (15 more...)

arXiv.org Artificial Intelligence

2411.08687

Country:

South America > Brazil > São Paulo (0.04)
North America > United States > Illinois > Champaign County > Champaign (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(3 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)

Add feedback

Bounding and Filling: A Fast and Flexible Framework for Image Captioning

Ma, Zheng, Wang, Changxin, Huang, Bo, Zhu, Zixuan, Zhang, Jianbing

arXiv.org Artificial IntelligenceOct-15-2023

Most image captioning models following an autoregressive manner suffer from significant inference latency. Several models adopted a non-autoregressive manner to speed up the process. However, the vanilla non-autoregressive manner results in subpar performance, since it generates all words simultaneously, which fails to capture the relationships between words in a description. The semi-autoregressive manner employs a partially parallel method to preserve performance, but it sacrifices inference speed. In this paper, we introduce a fast and flexible framework for image captioning called BoFiCap based on bounding and filling techniques. The BoFiCap model leverages the inherent characteristics of image captioning tasks to pre-define bounding boxes for image regions and their relationships. Subsequently, the BoFiCap model fills corresponding words in each box using two-generation manners. Leveraging the box hints, our filling process allows each word to better perceive other words. Additionally, our model offers flexible image description generation: 1) by employing different generation manners based on speed or performance requirements, 2) producing varied sentences based on user-specified boxes. Experimental evaluations on the MS-COCO benchmark dataset demonstrate that our framework in a non-autoregressive manner achieves the state-of-the-art on task-specific metric CIDEr (125.6) while speeding up 9.22x than the baseline model with an autoregressive manner; in a semi-autoregressive manner, our method reaches 128.4 on CIDEr while a 3.69x speedup. Our code and data is available at https://github.com/ChangxinWang/BoFiCap.

boficap-sa, fast and flexible framework, proc, (14 more...)

arXiv.org Artificial Intelligence

2310.09876

Country:

North America > United States > Pennsylvania (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models

Allingham, James Urquhart, Ren, Jie, Dusenberry, Michael W, Gu, Xiuye, Cui, Yin, Tran, Dustin, Liu, Jeremiah Zhe, Lakshminarayanan, Balaji

arXiv.org Artificial IntelligenceJul-15-2023

Contrastively trained text-image models have the remarkable ability to perform zero-shot classification, that is, classifying previously unseen images into categories that the model has never been explicitly trained to identify. However, these zero-shot classifiers need prompt engineering to achieve high accuracy. Prompt engineering typically requires hand-crafting a set of prompts for individual downstream tasks. In this work, we aim to automate this prompt engineering and improve zero-shot accuracy through prompt ensembling. In particular, we ask "Given a large pool of prompts, can we automatically score the prompts and ensemble those that are most suitable for a particular downstream dataset, without needing access to labeled validation data?". We demonstrate that this is possible. In doing so, we identify several pathologies in a naive prompt scoring method where the score can be easily overconfident due to biases in pre-training and test data, and we propose a novel prompt scoring method that corrects for the biases. Using our proposed scoring method to create a weighted average prompt ensemble, our method outperforms equal average ensemble, as well as hand-crafted prompts, on ImageNet, 4 of its variants, and 11 fine-grained classification benchmarks, all while being fully automatic, optimization-free, and not requiring access to labeled validation data.

large language model, machine learning, simple zero-shot prompt weighting technique, (17 more...)

arXiv.org Artificial Intelligence

2302.06235

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
South America > Colombia > Meta Department > Villavicencio (0.04)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)

Genre: Research Report (0.82)

Industry:

Media (1.00)
Leisure & Entertainment (1.00)
Health & Medicine > Diagnostic Medicine (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Google AI recreates Gustav Klimt paintings destroyed during WWII

#artificialintelligenceOct-10-2021, 17:50:16 GMT

Gustav Klimt created some of the world's most expensive masterpieces, but around 20% of his artworks have been lost. Among them are the so-called Faculty Paintings: Philosophy, Medicine, and Jurisprudence. The three pieces are believed to have been destroyed in a fire during World War Two. Only black and white photos of the artworks remain. The original paintings may never be seen again, but machine learning has come close to bringing them back to life.

algorithm, artwork, klimt, (9 more...)

#artificialintelligence

Country: Europe > Austria > Vienna (0.06)

Industry: Government > Military (1.00)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

NASA's lunar probe snaps eerie black and white image of Jupiter and two of its moons

Daily Mail - Science & techOct-6-2021, 14:04:59 GMT

NASA's Lunar Reconnaissance Orbiter - focused on observing the moon in preparation for humanity heading back to the celestial satellite - has snapped an eerie black and white photo of Jupiter and two of its moons. The LRO, which launched in June 2009, snapped the image of Jupiter and its moons, Io and Europa from 390 million miles away. The spacecraft sits roughly 62 miles (100km) above the surface of the moon, which is 239,000 miles from Earth. Given the extreme distance between the moon and the gas giant and the fact that the LRO is'aging' according to a statement, the image is a feat of technological strength. NASA's Lunar Reconnaissance Orbiter has snapped a black and white photo of Jupiter and two of its moons, Io and Europa (circled in red above) 'Because the Lunar Reconnaissance Orbiter spacecraft is aging (LRO launched over 12 years ago), it now only uses its two star trackers to keep tabs on where it is pointed, rather than its inertial measurement unit, which adds complications to imaging anywhere but straight down at the lunar surface (we don't want the star trackers pointed at the Moon rather than the stars!),' Brett Denevi, deputy principal investigator for the LRO Camera, said in a statement.

eerie black and white image, jupiter, lunar probe snap eerie black, (9 more...)

Daily Mail - Science & tech

Country: North America > United States (0.83)

Industry:

Government > Space Agency (0.94)
Media > Photography (0.92)
Government > Regional Government > North America Government > United States Government (0.83)

Technology: Information Technology > Artificial Intelligence (0.31)

Add feedback

AI photo tool 'simulates travelling back in time with a modern camera'

Daily Mail - Science & techApr-15-2021, 11:16:37 GMT

US researchers have created a photo colourising tool that uses artificial intelligence (AI) to create eerily lifelike images of deceased historical figures.

historical figure, lincoln, modern camera, (10 more...)

Daily Mail - Science & tech

Country:

North America > United States (0.31)
Asia > Middle East > Israel (0.05)

Industry: Consumer Products & Services > Travel (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.33)

Add feedback

Deep Learning based image colorization with OpenCV - CV-Tricks.com

#artificialintelligenceMay-25-2019, 06:48:58 GMT

In India, we celebrated the festival of color "Holi" last week. We celebrate the end of the winter with a splash of color because that's what the spring will bring us in a few days. When I was young, the celebrations were sparse. It was the decade of frugal parenting. We waited for festivals so eagerly because it meant parent approved outing and fun.

black and white photo, color space, image colorization, (11 more...)

#artificialintelligence

Country: Asia > India (0.25)

Industry: Media > Photography (0.55)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Bringing black and white photos to life using Colourise.sg

#artificialintelligenceFeb-5-2019, 13:42:01 GMT

While it is impossible to replicate the exact conditions in which the original photo was taken, it is possible to add colour to the photo to help us imagine what the photographer could have seen in that instant. It is incredible -- almost magical -- how a little bit of colour can bring us that much closer to that specific moment in time. And as such, for our hackathon in January, our team decided to build a deep learning colouriser tool trained specifically for old Singaporean photos. If you have old black and white photos and would like to colourise them, you can do so here: Colourise.sg. We do not store any of the photos that you upload to our colouriser application.

artificial intelligence, machine learning, social media, (4 more...)

#artificialintelligence

Industry: Media > Photography (1.00)

Technology:

Information Technology > Communications > Social Media (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback

The Key Differences Between Machine Learning and Artificial Intelligence

#artificialintelligenceOct-7-2018, 08:58:03 GMT

Machine learning and artificial intelligence (known as A.I.) both sound like futuristic terms for some dystopian future where robots take over the planet. There are lots of similarities and there is much overlap between different types of computer automated learning, inference, and autonomy, and each one comes with its own set of pros and cons. Sci-fi movies aside, there are lots of important differences between deep learning, machine learning, and artificial intelligence that highlight the different ways in which they work and the different applications they're best suited for. Here's what you need to know. This is the earliest and most broad term for computers acting on their own.

artificial intelligence, computer, machine learning, (11 more...)

#artificialintelligence

Industry: